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Abstract 

Background: For high usability, myo-controlled devices require robust classification schemes during dynamic 
contractions. Therefore, this study investigates the impact of the training data set in the performance of several 
pattern recognition algorithms during dynamic contractions. 

Methods: A 9 class experiment was designed involving both static and dynamic situations. The performance of 
various feature extraction methods and classifiers was evaluated in terms of classification accuracy. 

Results: It is shown that, combined with a threshold to detect the onset of the contraction, current pattern 
recognition algorithms used on static conditions provide relatively high classification accuracy also on dynamic 
situations. Moreover, the performance of the pattern recognition algorithms tested significantly improved by 
optimizing the choice of the training set. Finally, the results also showed that rather simple approaches for 
classification of time domain features provide results comparable to more complex classification methods of 
wavelet features. 

Conclusions: Non-stationary surface EMG signals recorded during dynamic contractions can be accurately classified 
for the control of multi-function prostheses. 



Background 

The myoelectric signals can be non-invasively recorded 
from the skin surface, and represent the electrical activ- 
ity in the muscles within the detection volume of the 
electrodes. They are easy to acquire and have shown to 
be an efficient way to control powered prostheses [1]. 
The control strategy for multi-function prostheses 
widely employs the pattern-recognition approach in a 
supervised way. This approach assumes that different 
types of motion, and thus muscle activations, can be 
associated to distinguishable and consistent signal pat- 
terns in the surface EMG. The patterns are learned by 
the algorithm using some part of the data (learning pro- 
cess), and the algorithm is then used to predict the 
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motions according to further data. The two main steps 
of pattern recognition algorithms are feature extraction 
and classification. First, representative features are com- 
puted from the surface EMG, and then they are assigned 
to classes that represent different motions. Various fea- 
ture extraction methods have been explored, such as 
those involving time-domain features [2], variance and 
autoregressive coefficients [3], or time-frequency based 
features [4]. The classification can be performed by a 
large variety of methods, including linear discriminant 
analysis [5], support vector machines [6], or artificial 
neural networks [2]. With these methods, current myo- 
control systems achieve >95% accuracy in a >10-class 
problem in intact-limbed subjects, and >85% accuracy in 
a 7-class problem in amputee subjects [7]. 

In addition to the classification approach, other meth- 
ods have been developed based on pattern recognition 
using an estimation approach. For example, the hand 
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kinematics can be estimated by training its association 
with the surface EMG of the contralateral limb with an 
artificial neural network [8,9]. Although this approach 
allows training in unilateral amputees, it not suitable for 
bilateral amputees who are the patient group who 
would most benefit from the use of active prostheses. 

The limitations of the current EMG pattern recogni- 
tion algorithms, which are mainly poor reliability and 
need for long training, prevent them from being used in 
clinical situations, in which the signals are not condi- 
tioned as well as in research laboratories. One of those 
limitations is related to the fact that current classifica- 
tion algorithms for EMG pattern recognition are mostly 
tested on stationary or transient scenarios separately. 
Transient surface EMG have been accurately classified 
using the transition as a whole [2], and stationary situa- 
tions (isometric contractions) have been extensively 
investigated in the past decades, showing promising 
classification results [7,10,11]. However, these two situa- 
tions have been always investigated separated, without 
the analysis of performance of an approach of classifica- 
tion of both types of signals concurrently. Therefore, 
this study investigates the performance of several pat- 
tern recognition classification algorithms for surface 
EMG signal classification, as used on static situations, 
when they are applied to dynamic situations, involving 
both static and dynamic contractions. Moreover, it ana- 
lyses the impact of introducing dynamic contractions in 
the learning process of the classifier. 

Methods 

Subjects 

Eight able-bodied subjects (5 males, 3 females; age, 
mean ± SD, 25.3 ± 4.6 yrs) participated in the experi- 
ment. All subjects gave their informed consent before 
participation and the procedures were approved by the 
local ethics committee. 

Procedures 

The experimental protocol focused on a 9-class problem 
involving hand and wrist motions designed for trans- 
radial prostheses. The 9 classes were: wrist flexion, wrist 
extension, forearm supination, forearm pronation, 
thumb close, 4-finger close, making a fist, fingers spread 
open, and no motion (relax). Six pairs of Ag/AgCl sur- 
face electrodes (Ambu® Neuroline 720 Ol-K/12, Ambu 
A/S, Denmark) were mounted around the dominant 
forearm at equal distances from each other, one third 
distal from the elbow joint (Figure 1). The surface EMG 
data were recorded in bipolar derivations, amplified with 
a gain of 2000 (EMG-16, OT Bioelectronica, Italy), fil- 
tered between 47 and 440 Hz, and sampled at 1024 Hz. 
The reference electrode was placed on the non-domi- 
nant forearm. In each experimental session, the subject 




Figure 1 Electrode positions. Schematic views of the position of 

the electrodes: (a) lateral, (b) transversal. 
I J 



was instructed to perform the 9 classes of motion twice, 
in random order. Each contraction was 10 s in duration, 
with 3 s resting periods between consecutive contrac- 
tions. Each subject performed three sessions on the 
same day, with 5-min breaks between the sessions to 
minimize fatigue. The rest periods between contractions 
and sessions were determined according to pilot tests 
and subjective evaluation of the subjects on the fatigue 
level. In total, 54 contractions (6 per class) were per- 
formed by each subject. In each contraction, the subject 
was instructed to start from the rest position, to reach 
the target position in 3 s, to maintain the target position 
for 4 s, and to return to the rest position in 3 s. Thus, 
in each contraction, one segment of static portion (4 s 
in the middle), and two segments of dynamic (aniso- 
tonic and anisometric, representing the two main 
dynamic situations in real movements) portion (3 s at 
each end) were obtained. These dynamic portions con- 
tained the full path between the rest and the target posi- 
tion. No feedback was provided to the subjects to 
regulate the position, but visual validation of the 
motions was performed by the experimenter. A user 
interface was used to provide the subject with the neces- 
sary visual prompt. 

Signal analysis 

The extracted data were segmented in windows of 128 
samples, corresponding to 125 ms, with an overlap of 96 
samples between two consecutive windows (32 samples 
delay between two consecutive windows) and classifica- 
tion was performed for each window. A sampling win- 
dow of 125 ms with a delay of 30 ms has been shown to 
be a good trade-off between decision delay and accuracy 
using the majority vote [12]. The final decision was 
taken by majority vote on the most recent 6 results. The 
response time is the sum of the length of the data used 
to take the decision (approximately 280 ms) and the 
computational time (evaluated between 5 ms and 20 ms 
using a workstation based on an INTEL 17 860 proces- 
sor). These choices make the response time in this 
study acceptable for prosthetic devices, as it is generally 
assumed that a delay shorter than 300 ms is acceptable 
for myoelectric control [13]. For each subject, the signal 
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processing algorithms (see below) were tested using a 
three-fold cross-validation procedure. Two of the three 
data sets were used as learning data and the remaining 
data set as testing data, thus the training was done on 
36 contractions (4 contractions per class) [6]. 

A linear discriminant analysis classifier (LDA) and two 
modes of Support Vector Machine (SVM) classifier with 
Gaussian kernel based boundary were tested. LDA was 
chosen because it is a simple statistical approach with- 
out any parameters to adjust, and has been shown to be 
one of the best classifiers for myoelectric control under 
stationary conditions [10]. The SVM offers a more com- 
plex approach. Depending of the choices of the kernel 
and parameters, SVM can generate a boundary able to 
follow more accurately the trends in the feature space 
on dynamic situations. Although the linear kernel was 
tested on pilot data, its parameter optimization was very 
specific to the training data set, resulting in poor classi- 
fication accuracy. On the other hand, non-linear bound- 
aries showed better performance. The Gaussian kernel 
was used, as it does not depend on a dimension selec- 
tion, but on a regularization parameter, allowing to cre- 
ate a boundary following the trends in the feature space 
without creating a number of small boundaries around 
the outliers. The Gaussian kernel depends on two para- 
meters for the definition of the boundary. The first 
mode of SVM used the One Versus Rest (OVR) 
approach, which separates each class with respect to all 
the others together, and the final decision is obtained by 
selecting the class maximizing the discriminant function. 
The second mode of SVM classifier used the One Ver- 
sus One (OVO) method, which provides a decision for 
each pair of classes, and the final decision is obtained by 
majority vote. Each classifier was trained using learning 
sets of features extracted by one of two methods: Time 
Domain features and Auto Regressive coefficients (TD 
+AR) (as in [10]), which are simple features extracted 
from the signal, and the marginals of the Wavelet 
Transform coefficients (WT) (as in [14]). In preliminary 
studies, the Coiflet wavelet of order 4 has shown the 
best results amongst the different orders of Daubechies, 
Coiflet and Symmlet wavelets, and thus it was selected 
as the mother wavelet in the current study [15]. As for 
the classifiers, those two feature extraction methods 
were selected to compare a rather simple method (TD 
+AR), with a more advanced method (WT). Both meth- 
ods have been successfully applied for myoelectric con- 
trol in static conditions [10,14]. 

Each classifier was trained using five intervals of the 
contractions to study the impact of the training data 
selection as displayed in Figure 2. Four different inter- 
vals (sections) were obtained from the middle of each 
contraction as follows: 4 s (only the static portion), 6 s 
(the static portion and an extra 1 s at each end; 
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Figure 2 Training intervals. Intervals used to train the classifier 
displayed for one contraction along with one channel of surface 
EMG. 
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Dynamicl in Figure 2), 8 s (the static portion and an 
extra 2 s at each end; Dynamic2 in Figure 2) and 10 s 
(the entire contraction). Finally, an additional training 
section was threshold-based (T-B, see below for descrip- 
tion of the threshold algorithm), so that the current 
window was used for training only if its EMG activity 
exceeded the threshold. 

A threshold was applied to each window, comparing 
the activity in the multi-channel surface EMG to a refer- 
ence level taken during the rest. The Teager-Kaiser 
energy operator [16] was used to detect the onset of the 
contractions. For each window, an activity value was 
given to each channel using the Teager-Kaiser operator. 
This value was thresholded by a coefficient multiplied 
by the values obtained at rest. The window was consid- 
ered as active if at least one channel crossed the thresh- 
old. For each subject, the coefficient of the threshold 
was determined on the static portions from the learning 
data. Its value was maximized under the constraints to 
have more than 97% of the windows from all classes 
active, and no less than 85% of the windows from each 
individual class active. These two conditions were deter- 
mined on pilot data and have shown to be consistent 
across the subjects. The threshold for each subject was 
obtained only from the learning data. The threshold 
values were rather different between subjects and chan- 
nels, spanning two orders of magnitude, mainly because 
of the difference in electrode placement and background 
noise. The level of normalized EMG activity during the 
contractions varied between 56% and 92% depending on 
the class. 

The cross-validation procedure was applied to each 
combination of feature set, training section and classi- 
fier. The accuracy was evaluated on the testing set on 
all classes (including the rest class). The classification 
action was performed if the EMG activity in the current 
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window exceeded the threshold obtained from the train- 
ing set. Otherwise the current window was considered 
as belonging to the rest class. 

Results 

Various pattern recognition methods are capable of high 
performance in myoelectric control under static condi- 
tions [11], which was confirmed by a preliminary analy- 
sis of the data in this study. As shown in Figure 3 
without using the threshold, most of the classification 
errors were clustered at the beginning and end of the 
contractions, when the subject was near the rest posi- 
tion. Applying the threshold substantially improved the 
performance by reducing the confusion of the rest class 
with other classes. 

Figure 4 displays the error rate of each pair of feature 
set and classifier when the training was exclusively per- 
formed on the static part of the contractions. Using this 
training set, when combined with a threshold, a simple 
LDA classifier with a TD+AR feature set achieved, on 
average, more than 88% accuracy in dynamic situations. 
The use of a more complex classifier (SVM-OVR) and 
feature set (WT) slightly improved the performance 
(-1% increase in accuracy). Figure 4 also indicates that 
the LDA classifier is more compatible with the TD+AR 
feature set than with the WT feature set. Indeed, the 
use of the marginals, which is a non linear operator, 
reduces the compatibility with the linear nature of the 
LDA. 

Figure 5(a) confirms that LDA does not perform opti- 
mally with the WT feature set. In addition, it shows that 
the combination of LDA with TD+AR features deter- 
mines high performance (error limited to -8%) when 
trained using some part of the dynamic portion in 
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Figure 3 Errors position. Position in time of classification errors 
during contractions, with threshold (black) and without threshold 
(grey). For each window position, the error is expressed as a 
percentage, averaged across subjects and contractions on that 
position. 




Figure 4 Error rates on static training. Error rate (mean and 
standard deviation) of the combinations feature set and classifier 
when training on the static part. 



addition to the static portion. Although the differences 
in performance when using different dynamic sections 
(sections including a portion of the dynamic contrac- 
tion) for training were very low (<0.6%), the best results 
were obtained using the threshold based training sec- 
tion, which provides automatically an efficient way to 
determine which portion of the signals should be used 
as the training set. 

Figure 5(b) shows that the SVM-OVO classifier with 
WT features determines high performance when includ- 
ing the dynamic portions in the training set. An error 
rate of 6.3% was reached when using the entire contrac- 
tion as training section. When using the TD+AR feature 
set, the performance also increased when using the 
dynamic portions for training and reached a 9.7% error 
when using the 8-s training section. Figure 5(c) indicates 
that the performance of the SVM-OVR classifier dete- 
riorates when more dynamic data are included in the 
training set. The OVR mode for SVM creates a bound- 
ary for each class separating it from all the others. 
Including the dynamic portion in the training set 
increases substantially the number of windows available 
for each class, and so the unbalance between the sizes 
of the two classes during the learning process increases. 
This reduces the efficiency of the SVM learning algo- 
rithm, which results in poorly generated boundaries. 

A three way ANOVA was applied on the error rate 
with the algorithm (TD+AR/LDA or WT/SVM-OVO) 
and the training section (5 training sections) as the fac- 
tors and the subject considered as a random variable. 
Only the TD+AR/LDA and WT/SVM-OVO were inves- 
tigated with this analysis since they are the most rele- 
vant combinations, as shown above. The analysis of the 
results revealed a significant effect from both factors 
and from the interaction between them (P < 0.005). 
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Figure 5 Error rates depending on the training section. Performance (mean and standard deviation) of the different combinations of feature 
sets and classifiers (a): LDA; (b): SVM-OVO; (c): SVM-OVR, depending on the training sections as defined in Figure 2. 



Figure 6 represents the significance of the interaction 
between the algorithm and the training section. A Sheffe 
post hoc test was applied to the training section factor, 
for both algorithms separately, to reveal the significance 
levels amongst pairs of training sections. For both algo- 
rithms, the static training section (4 s) showed signifi- 
cantly higher error rate than all the other training 
modalities investigated. However, the 6 s, 8 s, 10 s and 
T-B training sections did not provide significantly differ- 
ent results for any of the two algorithms. 
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Figure 6 Analysis of variance-Interaction. Error rates of the two 
algorithms included in the ANOVA depending on the training 
sections. 



Although the previous results show a significant 
improvement using the dynamic portions for training, 
the inter-subject variability obscures the relative perfor- 
mance across the different training sections. This varia- 
bility is related to two main factors: 

♦ subjects' ability to perform the exact movement fol- 
lowing a cue, 

♦ efficacy of the threshold on the resulting surface 
EMG. 

Therefore, we further define , an index that provides 
a measure of the overall "ability" on the subject i [15]: 



' $a + $a + S« + Sin + St 



Where each s l x is the error rate for the subject i using 
the training section with a length of x (T is for Thresh- 
old-based). We then normalize the error for each train- 
ing section with respect to the overall index of ability 
for each subject: 



E 



Ei 



These normalized errors reveal the relative perfor- 
mances of the training sections, and allow the results 
for each subject to be displayed on the same scale. Fig- 
ure 7 depicts the mean across subjects of the normalized 
errors for each training section, as well as the results for 
each subject. The relative performance of the training 
sections confirmed the trend of the non-normalized 
error observed in Figure 4, and the individual represen- 
tations are in most cases well clustered around the 
mean for each training section. 
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Figure 7 Normalized errors. The normalized errors depending on 
the training section for the TD+AR/LDA algorithm (a) and the WT/ 
SVM-OVO (b). 



A one way ANOVA was applied on the normalized 
errors for each algorithm using the training section as 
factor. In both cases, the results confirmed that the effect 
of the training section was significant. A Sheffe post hoc 
test was applied on these results and confirmed the pre- 
vious results for the TD+AR/LDA algorithm. For the 
WT/SVM-OVO algorithm, the post hoc test revealed sig- 
nificant differences between the training sections, divid- 
ing them in three groups (section 8 s and 10 s; section 6 
s and T-B; Static section). Table 1 summarize all results. 

Discussion 

The results of the study show that, using a threshold to 
detect the onset of the motion, surface EMG during 
dynamic tasks can be classified with accuracy compar- 
able to that obtained in static situations, when the train- 
ing section is properly selected (Table 1). 

Including some dynamic portions (6 s, 8 s, 10 s, T-B) 
of sEMG during the learning process significantly 
improved the performance of both LDA and SVM based 
algorithms compared to the static training (4 s). The 
inferior performance of the SVM-OVR classifier when 
dynamic portions are included in the training set is not 
likely related to the inclusion of the dynamic part. 
Rather, it is more likely due to the unbalance of size 
during the learning process, i.e. a 1 to 8 ratio between 
one class compared to all the others together. Reducing 
the number of samples taken for the elements of the 
biggest class during learning could solve this issue, but 
would require an additional step, and an optimization of 
the samples to select, which is beyond the scope of this 
study. 

Although the best results were obtained using the pair 
WT/SVM-OVO (6.3% ± 3.3% error), the disadvantage of 
this combination is the relatively high requirement in 
terms of optimization. Indeed, the SVM requests at least 
one penalization parameter, and in case of non-linear 
boundary two parameters which must be optimized. In 
addition, this study shows that the optimization of the 



Table 1 Results summary 





LDA 




SVM ovo 


SVM ovr 


Training Data sections 


TD 


WT 


TD 


WT 


TD 


WT 


Stationary: 4 s 


11.9 ± 5.38 


16.7 ± 6.72 


12.3± 5.47 


10.9 ± 5.41 


12.3 ± 5.61 


10.9 ± 5.09 


Dynamic 1:6 s 


8.84 ±4.13 


15.3 ± 6.53 


9.10 ±4.22 


7.37 ± 3.72 


21.1 ± 6.49 


23.7 ± 7.35 


Dynamic 2: 8 s 


8.00 ± 3.79 


13.3 ± 6.11 


9.75 ± 4.03 


6.34 ± 3.53 


41.3 ± 7.65 


23.9 ± 8.06 


All 10 s 


8.03 ± 3.82 


12.2 ± 5.70 


16.4 ± 4.92 


6.26 ± 3.44 


44.4 ± 7.00 


23.6 ± 7.51 


Threshold 


7.87 ± 3.70 


15.3 ± 5.91 


9.19 ± 3.58 


6.93 ± 3.55 


21.5 ± 12.5 


20.2 ± 8.55 



Summary of the results, with the average error rate across all the subjects depending on the feature extraction method, the classifier, and the training section. 
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training section has a great impact on the performance. 
Unfortunately, the effect of these factors seemed to have 
interaction, thus they have to be optimized together. 
This increases significantly the time required to train 
the algorithm and the amount of data required for 
training. 

On the other hand, the combination TD+AR/LDA 
showed a good performance (8.0% ± 3.5% error), and it 
does not require any optimization. Moreover, this study 
showed that this combination is much less sensitive to 
the training section compared to the WT/SVM-OVO 
combination, and that it reaches its optimal perfor- 
mance if some dynamic portions are included in the 
learning process. This shows that the selection of the 
training section in that case can be done automatically, 
by taking the entire contraction as training, or by using 
a threshold in activation. This results in a completely 
automated algorithm, that can be trained within a short 
period of time, and adapted to each patient using the 
threshold selection. Therefore, this combination is more 
suitable for clinical applications in which the training 
must be kept as short as possible. Interestingly, this 
combination of features and classifier has also shown to 
be the best suitable real-time myoelectric classification 
algorithm under static conditions [12]. 

In addition to the focus on classification, this study 
also presents a method for movement onset detection. 
The results presented depend on the accuracy of this 
method. The threshold was adapted individually, and 
applied identically for each investigated algorithm. 
Therefore, the impact of threshold selection on the 
relative performances of these algorithms is minimal. 
This approach aimed to simulate the clinical situations 
(i.e., one or more fixed thresholds per recording site) 
so that results obtained are as consistent as possible 
with what one would expect in real applications. The 
main result of the current study is that the relatively 
simple TD+AR/LDA approach maintains relatively 
high performance under the dynamic conditions tested. 
This result was obtained on healthy subjects. Further 
investigations will involve amputee patients as end- 
users of the system. According to previous work [7], it 
is expected that the results of this study will translate 
to patients, potentially with a decrease in the overall 
accuracy. 

Finally, it is important to notice that this study 
focused on the transitions between various movements 
and the rest position. Further optimization could be 
achieved by involving the transitions between all the 
combinations of active classes in the learning process. 
This would however increase the amount of training 
data and training time significantly making it impractical 
for clinical applications. Thus, a classifier less sensitive 
to such kind of training requirements as well as 



methods to decrease the retraining requirements of the 
algorithms should be further investigated. This remains 
a challenge for the ongoing studies along with propor- 
tional and simultaneous control. 

Conclusions 

The dynamic portions of EMG signals are important for 
real myocontrol systems and thus must be included in 
the learning process in order to achieve an overall high 
classification accuracy. When the learning set is properly 
chosen, rather simple pattern recognition approaches 
provide similar classification accuracies for dynamic as 
for static situations. 
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